A Quantitative Model of Yorùbá Speech Intonation Using Stem-ML
نویسنده
چکیده
We present a quantitative model of Standard Yorùbá (SY) intonation; it is designed to have parameters that are linguistically interpretable. The model is built and trained on speech data from a native speaker of SY. The resulting model reproduces the data well: its Root Mean Square prediction error (RMSE) is 14.00 Hz on a test set. We find that intonation is used to mark sentence and phrase boundaries: beginning syllables are systematically stronger, while ending syllables are systematically weaker than the medial syllables. The M tone is the strongest and the H tone is the weakest, though the differences are modest. We see comparable amounts of carry-over and anticipatory co-articulation. The resulting model for SY shows similar characteristics when compared to Mandarin and Cantonese intonation models.
منابع مشابه
Automated modeling of Chinese intonation in continuous speech
We built and trained a model of intonation in continuous Mandarin speech based on the Stem-ML model of interacting accents. With this model, we found that we can accurately reproduce the intonation of the speaker using only one accent template for each lexical tone category. The resulting parameters are interpretable, and we find that the fitted model is consistent with linguistic expectations....
متن کاملAutomated modelling of Chinese intonation in continuous speech
We built and trained a model of intonation in continuous Mandarin speech based on the Stem-ML model of interacting accents. With this model, we found that we can accurately reproduce the intonation of the speaker using only one accent template for each lexical tone category. The resulting parameters are interpretable, and we find that the fitted model is consistent with linguistic expectations....
متن کاملHierarchical Structure and Word Strength Prediction of Mandarin Prosody
We use Stem-ML to build an automatic learning system for Mandarin prosody that allows us to make quantitative measurements of prosodic strengths. Stem-ML is a phenomenological model of the muscle dynamics and planning process that controls the tension of the vocal folds. Because Stem-ML describes the interactions between nearby tones or accents, we were able to use a highly constrained model wi...
متن کاملQuantitative measurement of prosodic strength in Mandarin
We describe models of Mandarin prosody that allow us to make quantitative measurements of prosodic strengths. These models use Stem-ML, which is a phenomenological model of the muscle dynamics and planning process that controls the tension of the vocal folds, and therefore the pitch of speech. Because Stem-ML describes the interactions between nearby tones, we were able to capture surface tonal...
متن کاملF0 stylization and intonation modelling for Standard Yorùbá Text-to-speech application
This technical report documents experiment into stylization of the f0 curve on Standard Yorùbá (SY ) syllables as well as a technique for intonation modelling. A number of interpolation polynomials were evaluated using root mean square error and mean opinion score techniques. The stylisation experiment resulted in the selection of a 3 degree polynomial for modelling the f0 curves on Yorùbá syll...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007